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Inhibitor: Gp195 
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Inhibitor: YP42 
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_LI ILM I I LI 

BAMHI ECORV HIND3 AFL2 AKA3 

SAC2 AHA3 AHA3 AHA3 AHA3 
ECORV ECORI 
S5PI 




LI I i I LLI I 

AHA3 ASEl BSABl 

BGL2 PSTI 
AHA3 SALI 
ASU2 BAMHI 
BSTBl 



MetTrpSerTrpLysCysLeuLeuPheTrpAlaValLeuValThrAla 
1 GGATCCACTGGGATGTGGAGCTGGAAGTGCCTCCTCTTCT6GGCTGTCCTGGTCACAGCC 
CCTAGGTGACCCTACACCTCGACCTTCACGGAGGAGAAGACCCGACAGGACCAGTGTCGG 

1 BAMHI, 

V 

ThrLeuCysThrAlaAlalleSerValThrMetAspAsnlleLeuSerGlyPheGluAsn 
6 1 ACACTCTGCACCGCGGCGATATCTGTCACAATGGATAATATCCTCTCAGG ATTTGAAAAT 
TGTGAGACGTGGCGCCGCTATA6ACAGTGTTACCTATTATAGGAGAGTCCTAAACTTTTA 

71 SAC2, 78, ECORV, 

GluTyrAspVallleTyrLeuLysProLexiAlaGlyValTyrArgSerLeuLysLysGln 
121 G AATATGATGTTATATATTTAAAACCTTTAGCTGGAGTATATAG AAGCTTAAAAAAACAA 
CTTATACTACAATATATAAATTTTGGAAATCGACCTCATATATCTTCGAATTTTTTTGra 

138 AHA3, 165 HIND3, 

IleGluLysAsnllePheThrPheAsnLetiAsnZieuAsnAspIlelieuAsnSerArglieu 
181 ATTG AAAAAAACyiTTTTTACATTTAATTTAAATTTGAACGATATCTTAAATTCACGTC 

TAACTTTTTTTGTAAAAATGTAAATTAAATTTAAACTTGCTATAGAATTTAAGTGCAGAA 

207 AHA3, 220 ECOPV, 238 KFL2, 

LysLysArgLysTyrPheLeuAspValLeuGluSerAspI/euMetGlnPheLysHisIle 
24 1 AAGAAACG AAAATATTTCTTAGATGTATTAGAATCTGATTTAATGCAATTTAAACATATA 
TTCTTTGCTTTTATAAAGAATCTACATAATCTTAGACTAAATTACGTTAAATTTGTATAT 

251 SSPI, 289 AHA3, 

SerSerAsnGluTyrllelleGluAspSerPheLysLeuLeuAsnSerGluGlnLysAsn 
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301 TCCTCAAATGAATACATTATTG AAGATTCATTTAAATTATTG AATTCAG AACAAAAAAAC 
AGGAGTTTACTTATGTAATAACTTCTAAGTAAATTTAATAACTTAA6TCTTGTTTTTTTG 

331 AHA3, 342 ECORI, 

ThrLeuLeuLysSerTyrLysTyrlleLysGluSexValGluAsnAspIleLysPheAla 
361 ACACTTTTAAAAAGTTACAAATATATAAAAGAATCAGTAGAAAATGATATTAAATTTGCA 
TGTGAAAATTTTTCAATGTTTATATATTTTCTTAGTCATCTTTTACTATAATTTAAACGT 

366 AHA3, 

GlnGluGlylleSerTyrTyrGluLysValLeviAlaLyeTyrLysAspAspLeuGluSer 
421 CAG6AAGGTATAAGTTATTATGAAAAGGTTTTAGCGAAATATAAGGATGATTTAGAATCA 
GTCCTTCCATATTCAATAATACTTTTCCAAAATCGCTTTATATTCCTACTAAATCTTAGT 

IleLysLysVallleLysGluGlxiLysGlxiIiysPheProSerSerProProThrThrPro 
481 ATTAAAAAAGTTATCAAAGAAG AAAAGGAG AAGTTCCCATCATCACCACCAACAACACCT 
TAATTTTTTCAATAGTTTCTTCTTTTCCTCTTCAAGGGTAGTAGTGGTGGTTGTTGTGGA 

ProSerProAlaLysThrAspGluGlnLysLysGluSerLysPheLeuProPheLeuThr 
541 CCGTCACCAGCAAAAACAGACXS AACAAAAGAAGGAAAGTAAGTTCCTTCCATTTTTAACA 
GGCAGTGGTCGTTTTTGTCTGCTTGTTTTCTTCCTTTCATTCAAGGAAGGTAAAAATTGT 

AsnlleGluThrLeuTyrAsnAsnLeuValAsnLysIleAspAspTyrLeuIleAsnLeu 
601 AACATTGAG ACCTTATACAATAACTTAGTTAATAAAATTG ACGATTACTTAATTAACTTA 
TTGTAACTCTGGAATATGTTATTGAATCAATTATTTTAACTGCTAATGAATTAATTGAAT 

649 PACl, 

LysAlaLysIleAsnAspCysAsnValGluLysAspGluAlaHisValLysIleThrLys 
661 AAGGCAAAG ATTAACGATTGTAATGTTGAAAAAG ATG AAGCACATG TTAAAATAACTAAA 
TTCCGTTTCTAATTGCTAACATTACAACTTTTTCTACTTCGTGTACAATTTTATTGATTT 

LeuSerAspLeuLysAlalleAspAspLysIleAspXieuPheLysAsnHlsAsnAspPhe 
721 CTTAGTGATTTAAAAGCAATTGATGACAAAATAGATCTTTTTAAAAACCATAACGACTTC 
GAATCACTAAATTTTCGTTAACTACTGTTTTATCTAGAAAAATTTTTGGTATTGCTGAAG 

729 AHA3, 753 BGL2, 760 AHA3, 778 ASU2 BSTBl, 

GluAl a IleLy sLy sLeuIleAsnAspAspThrLysZjy sAspMetLeuGlyLysLeuLeu 
781 GAAGCAATTAAAAAATTGATAAATGATGATAC6AAAAAAGATATGCTTGGCAAATTACTT 
CTTCGTTAATTTTTTAACTATTTACTACTATGCTTTTTTCTATACGAACCGTTTAATGAA 

SerThrGlyLeuValGlnAsnPheProAsnThrZlelleSerLysLeulleGluGlyLys 
841 AGTACAGGATTAGTTCAAAATTTTCCTAATACAATAATATCAAAATTAATT6AAGGAAAA 
TCATGTCCTAATCAAGTTTTAAAAGGATTATGTTATTATAGTTTTAATTAACTTCCTTTT 

885 ASEl, 

PheGlnAspMetl/euAsnlleSerGlnHlsGlnCysValLysLysGlnCysProGluAsn 
901 TTCCAAGATATGTTAAACATTTCACAACACCAATGCGTAAAAAAACAATGTCCAGAAAAT 
AAGGTTCTATACAATTTGTAAAGTGTTGTGGTTACGCATTTTTTTGTTACAGGTCTTTTA 

SerGlyCysPheArgHlsLeuAspGluArgGluGluCysLysCysLeuLeuAsnTyrLys 
961 TCTGGATGTTTCAGACATTTAG ATG AAAGAGAAGAATGTAAATGTTTATTAAATTACAAA 
AGACCTACAAAGTCTGTAAATCTACTTTCTCTTCTTACATTTACAAATAATTTAATGTTT 
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GlnGluGlyAspLysCysValGlxiAsnProAsnProThrCysAsnGluAsnAsnGlyGly 
1021 CAAGAAG6T6ATAAAT6TGTTG AAAATCCAAATCCTACTTGTAACG AAAATAATGGTGGA 
GTTCTTCCACTATTTACACAACTTTTAGGTTTAGGATGAACATTGCTTTTATTACCACCT 

CysAspAlaAspAlaLysCysThrGluGluAspSerGlySerAsnGlyJjysLysIleThr 
1081 TGTGATGCAGATGCCAAATGTACCGAAGAAGATTCAGGTAGCAACGGAAAGAAAATCACA 
ACACTACGTCTACGGTTTACATGGCTTCTTCTAAGTCCATCGTTGCCTTTCTTTTAGT6T 

CysGluCysThrLysProAspSerTyrProLeuPheAspGlyllePheCysSerAM AM 
1141 TGTGAATGTACTAAACCTGATTCTTATCCACTTTTCGAT6GTATTTTCTGCAGTTAGTAG 
ACACTTACATGATTTGGACTAAGAATAGGTGAAAAGCTACCATAAAAGACGTCAATCATC 

1159 BSABl, 1168 PSTI, 1200 SALI, 



1201 TCGACCCTTGGAAGGATCC 
AGCTGGGAACCTTCCTAGG 

1214 BAMHI, 



1261 
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DNA AND AMINO ACID SEQUENCE OF BVp42-M 



attggatccactaaa 

13 atgtggtcttggaagtgtcttttattctgggctgtcttggtgacc 

MWSWKCLLFWAVLVT 
58 gccactctttgcacagcagcgatctctgttactatggacaacatc 

ATLCTAAISVTMDNI 
103 ctcagtggcttcgagaacgagtacgacgtaatctacctaaagccc 

LSGFENEYDVIYLKP 
14 8 cttgccggtgtctaccgttcattgaagaaacagatagaaaagaat 

LAGVYRSLKKQIEKN 
193 attttcacgttcaacctcaacctaaatgacatcctcaactcgcgc 

IFTFlSfLNLNDILNSR 
238 ctcaagaagcgaaaatacttcctcgacgtgttggaatccgacctt 

LKKR KYFLDVLESDL 
2 83 atgcaatttaagcacattagctctaacgagtacatcatagaggac 

MQFKHISSNEYIIED 
328 agcttcaagctcttgaattcagaacagaagaacaccctcctaaag 

SFKLLNSEQKNTLLK 
37 3 tcctacaaatacattaaggagtctgttgagaacgacatcaagttc 

SYKYIKESVENDIKF 
418 gcccaggaaggaattagctactatgagaaagtcctggctaaatac 

AQEGI SYYEKVLAKY 
4 63 aaggacgacttggaaagcattaagaaggtaatcaaagaagagaag 

KDDLES IKKVIKEEK 
508 gaaaagtttccgagctctccacccacaactcccccatcgcctgca 

EKFPSSPPTTPPSPA 
553 aagaccgacgagcagaaaaaagaaagtaagttccttccattcctc 

KTDEQKKESKFLPFL 
598 accaacatcgaaactctatataacaacctggtgaacaagattgat 

T N i ■ E T L Y N N L V N K I 0" 
64 3 gactacttaatcaacttgaaggcgaaaattaatgactgtaacgtc 

DYLINLKAKINDCNV 
68 8 gaaaaggatgaagcccacgttaagatcaccaagctttccgatctc 

EKDEAHVKI TKLSDL 

7 33 aaagccatcgacgataagattgacctgtttaagaaccacaacgat 

KAIDDKIDLFKNHND 
77 8 ttcgacgcaatcaaaaagttgatcaacgacgatactaagaaagac 

FDAIKKLINDDTKKD 
823 atgcttggaaaactgctgtcgacaggcttggtccaaaacttcccg 

MLGKLLSTGLVQNFP 

8 68 aacaccattataagcaagctgatcgaaggaaagtttcaggatatg 



an 
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NTIISKLI EGKFQDM 
913 ctgaacatctctcagcatcaatgcgtgaagaagcaatigtcccgag 

LNISQHQCVKKQCPE 
958 aattcaggttgcttccgccacttagacgaaagggaggaatgtaaa 

NSGCFRHL DEREECK 
1003 tgcctgctgaattataaacaggaaggagacaagtgcgtagagaat 

CLLNYKQE GDKCVEN 
104 8 cctaacccaacctgtaacgaaaataacggtggctgcgatgctgac 

PNPTCNENNGGCDAD 
1093 gctaagtgtaccgaggaggacagcggttccaatggcaagaaaata 

AKCTEEDS GSNGKKI 
1138 acttgcgaatgcacgaagcccgatagttaccctctcttcgacggt 

TCECTKPD S YPLFDG 
1183 atcttctgctcc 

I F C S 

ccacctcatcatcatcatcatcattaataaggtaccta 
PPHHHHHH** 

FIG.- 12B 
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1 GGATCCCTAAAATGTGGAGCTGGAAGTGCCTCCTCTTCTGGGCTGTCCTG 
MWSWKCLLFWAVL 
5 1 GTCACAGCCACACTCTGCACCGCG GGCGCC GCAGTTU^CTCCTTCCGTAAT 
VTATLCTAGAAVTP SVI 
101 TGATAACATACTTTCTAAAATTGAAAATGAATATGAGGTTTTATATTTAA 

DNILSKIENEYEVLYL 
151 AACCTTTAGCAGGTGTTTATAGAAGTTTAAAAAAACAATTAGA7UU\TAAC 

KPLAGVYRSL KKQLENN 
201 GTTATGACATTTAATGTTAATGTTAAGGATATTTTAAATTCACGATTTAA 
VMTFNVNVKDILNSRFN 
251 TAAACGTGAAAATTTCAAAAATGTTTTAGAATCAGATTTAATTCCATATA 

KRENFKNV LESDLI PY 
301 AAGATTTAACATCAAGTAATTATGTTGTCAAAGATCCATATAAATTTCTT 

K DLTSSNYVVKDPYKFL 
351 AATAAAGAAAAAAGAGATAAATTCTTAAGCAGTTATAATTATATTAAGGA 
NKEKRDKF LSSYNY IKD 
4 01 TTCAATAGATACGGATATAAATTTTGCAAATGATGTTCTTGGATATTATA 
SIDTDINFANDVLGYY 

4 51 AAATATTATCCGAAAAATATAAATCAGATTTAGATTCAATTAAAAAATAT 

KILSEKYKSDLDSIKKY 
501 ATC/SlACGACAAACAAGGTG/WUUVTGAGAAATACCTTCCCTTTTTAAACAA 
INDKQGENEKYLPFLNN 

5 51 TATTGAGACCTTATATAAAACAGTTAATGATAAAATTGATTTATTTGTAA 

lETLYKTVNDKIDLFV 
601 TTCATTTAGAAGCAAAAGTTCTAAATTATACATATGAGAAATCA7\ACGTA 

IHLEAKVLNYTYEKSN-V 
651 GTVAGTTAAAATAAAAGAACTTAATTACTTAAAAACAATTCAAGAC/^AATT 
EVKIKELNYLKTIQ DKL 
7 01 GGCAGATTTTA7^AAAAAATAACAATTTCGTTGGAATTGCTGATTTATCA0 

ADFKKNNNFVGIADLS 
7 51 CAGATTATAACCATAAT7\ACTTATTGAC7\AAGTTCCTTAGTACAGGT ATG 
TDYNHNNLLTKFLSTGM 
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8 01 GTTTTTGAAAATCTTGCTAAAACCGTTTTATCTAATTTACTTGATGGAAA 
VFENLAKTVLSNLLDGN 
8 51 CTTGCAAGGTATGTTAAACATTTCACAACACCAATGCGTAAAAAAACAAT 

L QGMLNISQHQCVKKQ 
901 GTCCACAAAATTCTGGATGTTTCAGACATTTAGATGAAAGAGAAGAATGT 

CPQNSGCFRHLDEREEC 
951 AAATGTTTATTAAATTACAAACAAGAAGGTGATAAATGTGTTGAAAATCC 
KCLL NYKQEGDKCVENP 
1001 AAATCCTACTTGTAACGAAAATAATGGTGGATGTGATGCAGATGCCAAAT 

NPTCNENNGGCDADAK 
1051 GTACCGAAGAAGATTCAGGTAGCAACGGAAAGAATUVTCACATGTGAATGT 

CTEEDSGSNGKKITCEC 
1101 ACTAAACCTGATTCTTATCCACTTTTCGATGGTATTTT CTGCAG TCATCA 
TKPDSY PLFDGIFCSHH 
1151 TCATCATCATCATTAATA AGGTACC 
H H H H * * 

Underlined sequences represent restriction sites. 

Bold letters represent alterations done to the leader sequence as described 
in the methods. 

The boxed letter represents the original sequence where a mis-sense 
mutation to a cytosine occurred. 
"*" represent stop codons. 
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